0^ 



O 



a 



o 



X 



cd 



Statistical mechanics of random two-player games 

J. Berg * 

Institute for Theoretical Physics 

Otto-von-Guericke University, Postfach 4ISO, 

D-39016 Magdeburg, Germany 



Using methods from the statistical mechanics of disordered systems we analyze the properties of 
bimatrix games with random payoffs in the limit where the number of pure strategies of each player 
tends to infinity. We analytically calculate quantities such as the number of equilibrium points, 
the expected payoff, and the fraction of strategies played with non-zero probability as a function of 
Tj , the correlation between the payoff matrices of both players and compare the results with numerical 

simulations. 
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Cvl . . 

The adaptation to the behaviour of others and to a complex environment is a process central to economics, so- 
ciology, international relations, and politics. Game theory aims to model problems of strategic decision-making in 

^ mathematical terms: Two or more interacting participants, called players, make decisions in a competitive situation. 

I Each player receives a reward, called the payoff, which not only depends on his own decision, but also on those of 

^ : the other players. In the generic set-up a number of players choose between different strategies, the combination of 

'^ ' which determines the outcome of the game specified by the payoff to each player. Each player strives to achieve as 

^ I large a payoff as possible. One of the cornerstones of modern economics and game theory is the concept of a Nash 

2 ' equilibrium ||I|, for an introduction see also H. A Nash equilibrium (NE) describes a situation where no player can 

H \ unilaterally improve his payoff by changing his individual strategy given that the other players all stick to their strate- 

'^ I gies. However this concept is thought to suffer from the serious drawback that in most games there is a large number 
of Nash equilibria with different characteristics but no means of telling which one will be chosen by the players, as 
would be required of a predictive theory. 

Q ' This conceptual problem already shows up in the paradigmatic model of a bimatrix game between two players X 

'— ', and Y where player X chooses a so-called pure strategy Xi G (1 . . . N) with probability Xi > and player Y chooses 

^^ ■ strategy Yj e (l...iV) with probability j/j > 0. The vectors x = (xi, ...,xn), y = (yi,...,yjv) are called mixed 

►^ ' strategies and are constrained to the {N — l)-dimensional simplex by normalization. For a pair of pure strategies {i,j) 

r — ■ the payoff to player X is given by the corresponding entry in his payoff matrix a^ whereas the payoff to player Y is 

If) I given by bij. The expected payoff to player X is thus given by i'j;(x, y) = J2i Xiaijijj and analogously for player Y. 

C^ A Nash equilibrium (x*,y*) is defined by 

^; i/^(x*,y*)=maxx J^'^(x,y*) 

0>: ^^nx*,y*)=maxy^nx*,y). (1) 

-4-^ ' . . . 

C^ ' The condition for a NE with a given set of expected payoffs v^ and v^ may be written as 

g: 

I ■ ^ flyl/j - i^^ < Xi > Xi(^ a^jyj - i/^) = Vi 

O : Y. ^*^y" -"" <0 % > yj{J2 ^*^*^" - ^'') = Vj , (2) 



where we have dropped the * indices for simplicity. The first column ensures that there are no pure strategies (and 
thus also no mixed strategics) which will yield a payoff larger than i/^ to player X and v"^ to player Y. Thus no 
^ ' player will have a reason to deviate from his mixed strategy. The second column ensures that the mixed strategies 
may be interpreted as probabilities and the third ensures that v^ = J2i j Xittijyj and analogously for player Y. In this 
situation there exists no mixed strategy which will increase the expected payoff to X if F does not alter his strategy 
and vice versa for Y. Nash's theorem |l| states that for any bimatrix game there is at least one NE. 

The third column in (0) states that whenever Xi is strictly positive, ^ ■ aijyj = v^ and whenever ^ OyJ/j ~ '^^ is 
strictly negative, Xi is zero. Thus for a given set of strategies played with non-zero probability (out of 4^ possible 
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choices) , the values of all non-zero components of a mixed strategy can be determined by solving the resulting linear 
equations ^ o^ijUj ^ v^'ii '■ Xi > and ^^ Xibij = v^ \/j : yj > subject to the normalization condition. 

Apart from applications in economics, politics, sociology, and mathematical biology, there exists a wide body of 
mathematical literature on bimatrix games concerned with fundamental topics such as exact bounds for e.g. the 
number of NE and efficient algorithms for locating them B . For games even of moderate size a large number of 
NE are found, forming a set of disconnected points. In general the different NE all yield different expected payoffs to 
the players. 

However many situations of interest are characterized by a large number of possible strategies and complicated 
relations between the strategic choices of the players and the resulting payoffs. In such cases it is tempting to model 
the payoffs by random matrices in order to calculate typical properties of the game. This idea is frequently used in 
the statistical mechanics approach to complex systems such as spin glasses MM, neural networks m, evolutionary 
models |8|j^ , or hard optimization problems ||l^,|ll[ • Recently this approach has been used to investigate the typical 
properties of zero-sum games |12 obeying a^ — —hij. A partial analysis of bimatrix games using the so-called 



annealed approximation has been given in [h3| 

In this paper we investigate the properties of Nash equilibria in bimatrix games with a large number of pure 
strategies and random entries of the payoff matrices. In this approach characteristics of the game are encoded in 
the distribution of payoff matrices - with only a few parameters - instead of the payoff matrices themselves. Using 
techniques from the statistical mechanics of disordered systems such as the replica-trick we calculate the typical 
number of NE with a given payoff. 

The paper is organized as follows: Having set up the probability distribution of payoffs to be considered, we 
construct an indicator function for NE which will allow us to count the number of NE. Then the average of the 
logarithm of the number of NE over the disorder will be calculated. The solution is discussed both in game theoretic 
and in geometric terms and is compared with the results of numerical simulations. Finally, we give a summary and 
an outlook to future developments. 

I. THE DISTRIBUTION OF PAYOFF MATRICES 

We consider bimatrix games with square payoff matrices {a^j, 6y } i,j = 1 ... TV, where the thermodynamic limit 
consists of A^ — > cx). We assume that the entries of the payoff matrices at different sites are identically and inde- 
pendently distributed. Since the two payoff matrices may be multiplied by any constant or have any constant added 
to them without changing the properties of the game in any material way there is no loss of generality involved in 
considering payoffs of order A~^ " and of zero mean. In the thermodynamic limit one finds that only the first two 
moments of the payoff distribution are relevant, as is generally the case in fully connected disordered systems described 
by mean-field theories. Hence the entries of the payoff matrices may be considered to be Gaussian distributed. Then 
the only property of the distribution of payoffs which is not fixed by these specifications is the correlation k between 
entries at the same site of the two payoff matrices. 

We thus choose the entries of the payoff matrices to be drawn randomly according to the probability distribution 

(s X su x^ TT ^ / ^(4 ' ^^^^i^v + 4) 1 .o^ 

P({«.}, m) = n ^^7^=^ -P I ^^r-7^^ 1 ' (3) 

i.e. a Gaussian distribution with zero mean, variance 1/N and correlation {{aijbki)) — nSikSji/N for all pairs {i,j) and 
{k, I). Here and in the following, the double angles denote the average over the payoff distribution (||). For k = — 1 (^) 
includes a Dirac-delta (5(a.y -I- bij) corresponding to a zero-sum game and we recover the situation of [n2l as a special 
case. K = corresponds to uncorrelated payoff matrices and k = I is the so-called symmetric case a^ == bij where the 
two players always receive identical payoffs. 

Thus the parameter k describes the degree of similarity between the payoffs to either player and can be used to 
continuously tune the game from a zero-sum game to a purely symmetric game. In the former case, the gain of one 
player is the loss of the other, so generally negative k correspond to a competitive situation, whereas for positive k 
there are many pairs of strategies which are beneficial to both players. 

II. THE ENTROPY OF NASH EQUILIBRIA AND THE INDICATOR FUNCTION 

In this section we construct an indicator function which is zero at a NE with payoffs v^ and v^ to players X and 
Y respectively, and non-zero everywhere else. This function will be made the argument of a properly normalized 



Dirac-delta function. Integrating the Dirac-delta function over the mixed strategies of both players, we are effectively 
counting the number of NE with the specified payoffs. From the resulting spectrum of NE the statistical properties 
of NE in bimatrix games may be deduced. Since we expect the number of NE to scale exponentially with the size of 
the game our central tool of investigation will be the entropy of Nash equilibria defined by S{v^ ^ v^) — -^ h\J\f{v^, v^) 
where J\f{i'^, v^) is the number of NE with the specified payoffs per unit interval within a small interval around v^ 
and i^y . Since NS{v^ ^ v^) is expected to be an extensive quantity, we may assume that S{v^ , v^) is self-averaging and 
in the thermodynamic limit the average value of the entropy will be realized with probability one. Hence the central 
goal of our calculation will be to evaluate {{S{v^ , ^^)))- 
In this framework the total number of NE is given by 

N^ /d^-di/^e^^t"^-''") , (4) 



so in the thermodynamic limit the NE will be exponentially dominated by the maximum of the curve S := 
TaaxS^p^ jV^). This implies that a randomly chosen NE will yield the payoffs where the maximum occurs with 
probability 1. On the other hand, the line where S{i'^, v^) — Q delimits the smallest and the largest values of i^^, v^ 
for which there is still an exponential number of NE. 

The three expressions may be encoded in a single condition [|l4[ by introducing the variables x and y with 
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- i/^ Xi^O 


J2^ X^biJ - 





yj - \ v^ ^.h.. - ,,v J. - n ■ (^) 



Condition (0) may be written as 






II = yM-yj) - (E, i^Q{ii)b^, - ^^) = , (6) 



so for positive Xi, Xi = Xi, whereas for negative Xi we have Xi — E , '^ijVj^iyj)^^^ ■ Furthermore we have Xi = Q if ii < 
and ^ . aijyjQ{yj)—v^ = for Xi > 0. The condition Xi{^- aijyj—v'-^) = is thus satisfied automatically. Analogous 
relations hold for player Y . The new variables therefore serve as a convenient tool to encode the 'complementary' 
quantities xi and ^ o-ijUj — v^ va a, single variable. Analogous relations hold for player Y . The density of NE with 
payoffs v^ and v^ may thus be written as 

M{v^,vy)^ j d^,{i)d^,{y)Y[5{It)Y[6{Iy)\\^^^\\ , (7) 

^ 3 

where the mixed strategies are rescaled to Ei ^i — Ej Vj = N so we define the measure dji as 

d^l{i)=\ldx,5\Y,i^^{i^)~N\ . (8) 

This scaling of the mixed strategies assumes that the an extensive number of strategies are played with non-zero 
probability, so the individual terms Xi and yj are all of order 0(1). The integrals over x, y effectively amount to 
choosing a set of strategies with xi > Q and yj > 0, solving the resulting linear equations for the components played 
with non-zero probability, and checking if this candidate for a NE fulfills the conditions (0) . It may thus be viewed 
as performing the so-called support enumeration algorithm analytically Q . 

III. CALCULATION OF THE TYPICAL NUMBER OF NASH EQUILIBRIA 

In this section we calculate the average of Sl^x, i^y) over the disorder (H). In order to represent the logarithm of (M) 
we use the replica-trick In A/" = lim„_>o ^•^" 3'iid compute the average over A/"" for integer n taking the limit n —^ 
by analytic continuation at the end. Using integral representations of the Dirac-delta function we obtain 



■^ " dn{Si'')dxf yj f dn{y"^)dy'} 



Af-(u^,^y) = f[f^t^^}^l[r. 



2tt 

-z E y"0(-^")y," + * E i"0(^n^^..i," - ^y" E y°} 

(iidct(i?)iir, (9) 

where a runs from 1 to n. The most awkward term of this expression is the absolute value of the normalizing 
determinant, i.e. the Jacobian matrix of I^,!** given by 



9(i,y) ^-&.,e(f,) %'e(-j7,); • ^'''^ 

which arises from the coefficients of i, and y in I^ and I^. Since we are only interested in the absolute value of the 
determinant, we are free to interchange rows and columns of this matrix. Rearranging the rows and columns of D 
such that the y-j^N strategies with x^ > and the -pyN strategies with j/j > are grouped together, one finds that 
only the resulting quadratic submatrix of size N{px + Py) by N{px + Py) contributes to the determinant of D. From 
(0) one finds that the distinction between p^. and Py is immaterial since the number of strategies played by player X 
at any NE always equals that played by Y and the determinant of D is zero for pa; ^ Py In the following we will 
assume that -^ ln(det D) is a self-averaging quantity depending on p^ = Py Splitting off the normalizing determinant, 
the average over the payoffs may now easily be performed, details of the calculations are given in appendix ^. The 
average over the disorder introduces a coupling between the replicas and one introduces the symmetric matrix of the 
overlaps between mixed strategies of each player as order parameters, 

as well as their conjugates %j^ ■ At non-zero values of k we also obtain terms which couple the phase-space variables 
Xi to the auxiliary variables Xi and similarly for player Y, so we also introduce the order parameters 

Kb^j^Yl 'i^s^'Mit) , Kb = ^T. 'y'MQir,) ■ (12) 

i J 

Similarly, in order to include the normalizing determinant we introduce the order parameters 

Pa = j^T.^ii^)^ pl-j^J2^iyl) (13) 

giving the fraction of strategies played at a NE. 

Anticipating the limit n — > 0, the quenched average of the normalizing determinant may be computed using results 
from the theory of random matrices as outlined in appendix H giving (( ln(|| det(Z?)||))) = A'^p(lnp — 1). 

We finally obtain 

m iy ''^')»=ijLy ^^-j^vvj ^i^7(^ny ^^^v^^^^'^-p'^ny ^-^ 

exp{-7V E %^%^ - -^ E ^a^^a. + ^ E^a'^Pa'^ + ^ E ^"'} 
a>h a.b a a 



where 



exp{iV[G--|-G^]}((||det(I?)|l")), (14) 

G- = InJ] / ^Y^exp{/:-({x^xn)} 

^^— explE rati^QiinS^'Qii") + ^ E K,^S:'^S:''ei£') - - E iL^:^ 



x' 



-i^5°e(-i")£° 



w^ Y^ x" 



Y^Elx'^Qii'^) 






Pl^ii")} 



Gy = ln[| 



27r 



exp{£^({y^y'^})} 



i^n / ^^ e^piE €,rm'')v'm') + ^ E K^r e(r )*y'' - J E -^a"^"^ 



a>6 



a,b 



a,b 



-z J2 r^i-rw -^'^'T.y'-T. ^ir^m - Ep'©(y")} 



(15) 



In the thermodynamic hmit TV — > c» the integrals over order parameters in ( p4| ) may be performed by saddle point 
integration. In order to be able to analytically continue the saddle-point to n ^ we choose the replica-symmetric 
(RS) ansatz for the order parameters 



(16) 
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qf ~ jj J2i ^i^i denotes the self-overlap of the mixed strategies of player X, whereas q^ — jj X^i ^l^f characterizes 
the overlap between the mixed strategies corresponding to two distinct NE and analogously for player Y . 



The integrals over £°, i", y", y"" may be evaluated and the limit n ^ may be taken as outlined in appendix A 1 . G' 



and Qy evaluated at the RS-saddle point are symmetric with respect to an interchange of the players X and Y . Thus 
the maximum of S{v^ ,vy) occurs at equal payoffs and in the thermodynamic limit NE with any other combination 
of payoffs will be exponentially rare by comparison. Hence we may restrict our discussion to the case v^ = i/V = v, 
where all order parameters are symmetric under interchange of the players. 
We thus obtain the average entropy of the number of NE within RS 
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(17) 



(18) 



(19) 



where K(x) is a shorthand for H(x)/G(x) with G(x) — — i= exp(— a;2/2), and H{x) — f°° dyGiy). The extremum is 
to be taken over all order parameters, The saddle-point equations corresponding to (O) may be solved numerically, 
their solutions will be discussed in detail in section IV . 



A. The distribution of the strategy strengths and the potential payoffs 



In this section we calculate the distribution of strategy strengths Px{x) — {{j^'Yli^i^i ~ ^))) ^^'^ the potential 
payoffs px^ (A) — ((-^ J2i ^iJ2i ^ijVj ^ ^))) ^^ ^-^1. Due to the symmetry of (p^) under an interchange of players for 



^x _ ^y j^ jg sufRcient to calculate these distributions for a single player only. We make use of the set of variables Xi 
introduced in section ^ since the distribution pi{x) is equal to Px{x) for a; > and equal to px^{\ — v'^) ior x < 0. 



rO 



By the same token, the fraction of strategies with Xi = is equal to /__^ dxpx{x) — 1—p and the fraction of potential 





payoffs with Af = v^ is Jg dxpx{x) = p 



Since all pure strategies are equivalent after averaging over the payoffs (translation- invariance), we calculate 

3(I",I") 
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a=l '' i,a j,a a V ; ^ 7 

In order to be able to perform the average over payoffs occurring in both the numerator and the denominator, we 
have represented the denominator by n — 1 replicas. The average over payoffs now proceeds exactly as in the previous 
section. Introducing the matrices of order parameters qat again, the i = 1 term may be split off from the saddle point 
integral without distorting the saddle point in the thermodynamic limit. Taking the replica symmetric ansatz one 
obtains 

p,(x)= limj] / —^ exp{/:-({^^xn)}<5(il-^) (21) 



(x+v-,/q^b)'^ 



fdadbp^Ja,b)- 1 exp{- ^"tr^^7 } 
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where /^^({J^jX"}) was defined in (p^), the order parameters take on their saddle point values, and we have dropped 
the pla yer-in dices of the order parameters. These functions have to be evaluated numerically and will be discussed in 
section |IVC| . 

The same procedure may be used to calculate another quantity of interest, namely the fraction w of pure strategies 
which are both played with non-zero probability in two randomly chosen NE equilibria. Like goi this quantity is a 
measure of the degree of similarity of two randomly chosen NE. However w does not directly depend on the self-overlap 
of the mixed strategies and may serve to test if there are strategies which are more likely to be played at a NE than 
others. From (|lj) and (|l^) one obtains 
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B. The stability of the replica-symmetric saddle point 

The results for the quenched average were derived on the basis of the replica-symmetric ansatz (uw. In this section 
we investigate the stability of this ansatz with respect to small fluctuations around ( |l6|) in order to check if this ansatz 
is at least locally stable. We restrict ourselves to the special case k = 0, where the payoff matrices are uncorrelated and 
the order parameters i?^^, i?^^ do not arise jl^. We consider small transversal fluctuations around the RS saddle-point 
and expand (n4) to second order in these fluctuations to obtain 

S = Sns + ^A^Af A + 0{A^) , (23) 



where A denotes a vector of small fluctuations in the ofl-diagonal elements of the order parameters ^afc' 9ab' ^afc' Cfc 
and M is given by 
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(24) 



Due to the symmetry of (14) under an interchange of the players at the RS saddle-point, only 3 different non-trivial 



submatrices need to be evaluated. The criterion for RS to be locally stable needs to be determined by working out 
the paths of integration in the complex plane of q^ and q^. Denoting the replicon-eigenvalues of g?^ g.^ , „ ^ '?"^ , 



and 






by Ai, A2, A3 respectively one obtains the criterion for the local stability of RS 



-— (A1A2 — (A3 
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if) <0 
If) <0 



(25) 



For details of the calculation see appendix 



IV. DISCUSSION OF THE RESULTS 

The quantity Sn{v) contains a wealth of information. We begin by discussing the general shape of Sk,{v) and the 
number of NE as a function of k, then turn to the statistical properties of NE and give a geometric interpretation of 
the results, and finally discuss the distribution of potential payoffs and the strategy strengths. 



A. Sk{v) and the number of NE 



The expression ( p7| ) for S^ {v) defines a family of curves with a pronounced maximum shown exemplarily for k = 
in figure 




FIG. 1. The results of the quenched averages of Sk(i') for k = —0.8 and n — respectively (bottom and top respectively). 
For n — replica symmetry is locally stable for u > 0.67 as indicated by the black dot. 



As argued in section || in the thermodynamic hmit the maximum of Sk{i^) dominates the spectrum of NE. 

Another point of interest is the value of v where Sk{v) crosses the 5 = axis. Due to the symmetry of S{v^ , v^) 
under an interchange of the players this point indicates the NE with the maximum sum of the payoffs. For k = — 1 
it takes on the value and increases monotonously with k. At k = Kc it diverges to infinity; Si^{v) no longer crosses 
the 5" = axis. In this case there is an exponentially large number of NE offering an arbitrarily large payoff to either 
player, where an arbitrarily small fraction of strategies are played. From the annealed approximation one obtains 
Kc ~ —.59, the corresponding result from the RS expression for the quenched average is Kc ~ —.58. This effect may 
be explained as follows: At large values of n players may pick a few of the pairs of strategies {i,j), which offer a large 
payoff to both of them and play them with a large probability. An exponential number of NE may be constructed in 
this way, even though their number is exponentially small compared to the total number of NE. 

The entropy of NE given by the maximum 5^ of S'k {v) is shown in figure 0. 




FIG. 2. The RS entropy of NE Sk as a function of k (solid line). The numerical results stem from enumerations with system 
size N=18 averaged over 100 samples, the error-bars denote the statistical error. The inset shows the finite-size effects for the 
case K = Q. So is plotted against 1/N, the analytic result for Af ^ cx) is indicated by the filled diamond. 



We find 5^=-! = since there is only a single equilibrium point for zero-sum games. S^ increases with k, so for all 
K > — 1 the typical number of NE scales exponentially with N. The maximum of the typical number of NE is reached 
for the case of symmetric games, where 5^=1 ^ 0.358. This result may be compared with a rigorous upper bound 
for the maximal number of NE in a bimatrix game derived using geometric methods MM: For any non-degenerate 
iV-by-iV bimatrix game with large N there are at most e"-^^^^ equilibrium points. As expected, the typical-case 
scenario does not saturate this bound, at least not for the distribution of payoffs considered here. Nevertheless for 
K > — 1 the typical number of NE investigated here and the maximal number of NE both scale exponentially with N. 

The increase of the number of NE with the correlation between the payoff matrices may be explained as follows: 
As will be discussed in the next section, the payoff v to both players increases with k. For increasing values of 
1/ = i/^ = i^y the necessary (but not sufficient) conditions for a NE 



y aiji/j < u^ Xi > OVi 
i 

y^ Xibij < u^ Hj > OVj 



(26) 



become increasingly easy to fulfill. In fact for v ~ only a single point on the simplexes of the two players fulfills 
(Pq ) , whereas for large i^ a correspondingly large section of the simplexes qualify as a candidate for equilibrium points 
p2[ . As a result the number of points which apart from (Bq) obey (^ «ijj/j ~ v^)xi = and (^^ Xibij — v^)yj = 
and thus constitute NE increases with n. 



B. The statistical properties of Nash equilibria 

In the thermodynamic hmit not only the number of NE will be dominated by the maximum of S^ (i^) , but a randomly 
chosen NE will also give the payoff i/ == i/^ = i/^ = argmax5'K(i') with probability one, because the number of NE with 
this payoff is exponentially larger than the number of all other NE. Similarly, the selfoverlap, the mutual overlap, 
and the fraction of strategies played with non-zero probability will take on their saddle-point values evaluated at the 
maximum of Sk{i') with probability 1. 




4^ ^^ -»^ -*t 



FIG. 3. The payoff v (solid line) and the fraction p (dashed line) of strategies played with non-zero probability of the typical 
NE. The analytic results are compared with numerical simulations for A'^ = 50 averaged over 200 samples. 




FIG. 4. The self-overlap of mixed strategies qi (solid line), the overlap go between the mixed strategies of different NE 
(dashed line), and the ratio qo/qi (long-dashed line). The analytic results are compared with numerical simulations for A'^ = 50 
averaged over 200 samples for gi and A*' = 18 averaged over 100 samples for go- 

Figure ^ shows the payoff dominating the spectrum of NE and the corresponding fraction p of strategies played with 
non-zero probability, whereas the self-overlap of mixed strategies gi and the overlap qo between the mixed strategies 
of different NE are shown in figure |j. 

At K = — 1 we recover the results for zero-sum games with qi — qo = n, v = 0, and p — 1/2 llj]. As k rises, the 



payoff increases. This effect may be understood as follows: At increasing k the outcome of a pair of strategies (i, j) 
which is beneficial to player X say, tends to become more beneficial to player Y . As a result players focus on these 
strategies and the payoff at a NE to both players rises. By the same token, the fraction p of strategies which are 
played with non-zero probability at a NE decreases with k and the self-overlap qi of the mixed strategies increases. 
The geometric structure of the set of NE may be elucidated by considering the mutual overlap qo — jj X^i ^1^1 
between mixed strategies of different NE. At n — —1, where there is only a single NE, go equals the self-overlap 
qi. After an initial increase q^ decreases with increasing k. The initial increase of go is due to the rapid increase of 
the lengths of the mixed strategy vectors and is thus not seen in the ratio between the overlaps. This result may 
be interpreted geometrically in that the NE become more and more separated with increasing k, and for k — > +1 
they end up in nearly uncorrelated positions, {x^x^) — {x^){x^) = .21. At the same time an increasing fraction of 
components of the mixed strategies have Xi = 0, i.e. lie on an edge of the simplex. 



w,p 




FIG. 5. The fraction of strategies w (top) played in both mixed strategies of two randomly chosen NE and the square 
p^ of the fraction of strategies played at a single NE (bottom) against k. The analytic results are compared with numerical 
simulations for Af = 50 averaged over 200 samples for p^ and A'^ = 18 averaged over 100 samples for w. 

Even though the NE spread over the simplex with increasing k, players still tend to focus on specific strategies. 
This may be seen by comparing the fraction of strategies w played in both mixed strategies of two randomly chosen 
NE with the corresponding result p^, which would result if players chose p strategies to be played with non-zero 
probabilities at random. From figure H one finds that although w decreases with n consistent with the spread of NE 
over the simplex it always remains above p^ . This behaviour is consistent with the idea that with increasing k players 
focus on pairs of strategies which are beneficial to both, of which there is a large number for large values of k. 

Since NE are isolated points, replica symmetry describes a set of equilibrium points which are distributed uniformly 
over a part of the simplex with opening angle arccos(qo/'Zi)- A replica-symmetry breaking scenario would involve 
clusters of NE, and maybe even clusters of clusters, so an ansatz explicitly including more than two overlap scales 
would have to be employed along the lines of the Parisi-scheme [g|. However, at least for k = 0, we found that 
replica symmetry is locally stable for v > 0.67 and most importantly at the maximum of the curve. RS remained 
locally stable across the range of v investigated, nevertheless RS may become locally unstable again at sufficiently 
large values of v. Thus we may conclude that for typical NE aX k — Q the replica-symmetric ansatz is self-consistent. 
Since we know from the results of |1J] that RS is marginally stable at k = — 1 one may in fact speculate that for the 
typical NE the RS scenario holds across the entire range of k. Nevertheless there may well be distributions of the 
payoffs which lead to non-uniform distributions of NE and to replica-symmetry breaking, presumably distributions 
with large values of k, or with correlations between the entries of the payoff matrices at different sites. 



C. The distribution of potential payoffs and the strategy strengths 



Figure o shows the distribution of strategy strengths Px{x) = {{-^ X^i '^i^i ^ ^))) (^ > 0) ^^^"^ ^^^ potential payoffs 
PAt(A) = {{jf^i'^Cl2jO'ijyj ^ '^))) i^ < 0) calculated in section III A|. The decrease of the fraction of strategies 
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played with non-zero probability, /„ dxpx{x) = p with k is clearly visible. One also finds a marked tendency for both 
players to use large values of Xi and yj for decreasing values of p, as is demanded by the normalization condition. 

One also observes the formation of a 'shoulder' in the distribution of Pxix) — p\^{X — i^^) {x < 0) centered at —i^. 
It shows that the distribution of ^ o,ijyj remains peaked at zero leading to the formation of the shoulder sd x = —v 
as v increases. 




FIG. 6. The distribution of the potential payoffs 
1 (x < 0) and the strategy strengths (x > 0) for 
K — —1, 0, 1 from top to bottom 



D. Comparison with numerical results 

The numerical results for figure y, |6|, go in ^j and w in ^ were obtained by using so-called vertex enumeration to 
enumerate all NE. Since the computational effort for vertex enumeration scales with ~ 2.598^, the system size had 
to be restricted to TV = 18 and averages were taken over 100 samples, resulting in pronounced finite size effects. 
Nevertheless the increase of the number of NE with k is clearly confirmed by the simulations. 

The numerical results for figure 0, qi in figure 0, p in H, and figure were obtained by using an iterated variant of 
the Lemke-Howson algorithm pqM^ to locate a single NE and by averaging the results for A^ = 50 over 200 different 
realizations of the payoffs. Although some finite-size effects remain, there is good agreement between the analytical 
and the numerical results. 
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V. SUMMARY AND OUTLOOK 

We analysed the properties of Nash equihbria in large random biniatrix games. To this end we constructed an 
indicator function which was used to count the number of NE with given payoffs to both players. We found that the 
number of NE is exponentially dominated by NE with a certain payoff to both players, and a certain set of order 
parameters. This implies that for a randomly chosen Nash equilibrium quantities such as the fraction of strategies 
played with a given probability, the self-overlap, and most importantly the payoff to either player take on a specific 
value with probability f . 

We considered square payoff matrices and argued that for large games and identically and independently distributed 
elements of the payoff matrices at different sites {i,j), the only relevant parameter of the probability ensemble is the 
correlation between elements of the same site of the payoff matrices a and b. We then calculated the quenched average 
of the number of Nash equilibria, from which one may also deduce quantities such as the payoff, the self-overlap and 
the mutual overlap of mixed strategies at NE, and the distribution of the strategy strengths and the potential payoffs. 

We found that both the number of equilibrium points and the payoff to both players increase with the correlation 
between the payoff matrices: With increasing correlation the number of pairs of strategies which are beneficial to 
both players grows. Players may focus on these pairs and achieve a larger payoff, the fraction of strategies played 
with non-zero probability decreases accordingly. From the values of the saddle-point parameters one may also deduce 
information on the geometric properties of NE: With increasing correlation between the payoff matrices the NE spread 
out over wider regions of the simplex. These analytic results were quantitatively compared with extensive numerical 
simulations and good agreement was found. 

Another point of relevance is that for a sufficiently large correlation between the payoff matrices, an exponentially 
large number of NE appears which offer arbitrarily large payoffs (on the statistical mechanics scale) to both players. 
The number of such NE is of course exponentially small compared to the total number of NE, nevertheless these 
equilibrium points may be relevant if players are free to choose equilibrium points. 

A number of generalizations and extensions of these scenarios may be envisaged at this stage, including the investi- 
gation of bimatrix games with rectangular payoff matrices or payoff matrices with correlations between the elements 
at different sites. Furthermore, a scenario of games of several players might be extended to describe cooperative 
games, where coalitions of players pool their payoffs and seek to maximize the gain of their respective coalition. In 
this context it may also be interesting to consider the case of 0{N) players choosing between 0(f ) strategies. 

Acknovirledgments: I would like to thank A. Engel, M. Opper, and M. Weigt for fruitful discussions and the 
Studienstiftung des Deutschen Volkes for financial support. 

APPENDIX A: THE QUENCHED AVERAGE 

In the following we give a derivation of the quenched average of the entropy of NE. In order to represent the 
logarithm of M we replicate n times the expression for the number of NE M (|^) to obtain (^ . 

Treating the normalizing determinant as a self-averaging quantity, we may split off In || det(_D)|| with 

from (|9|) and separately average the normalizing determinant over the disorder. Leaving out all the rows and columns 
which have only the entry 1 along the diagonal and do not contribute to the determinant, we are left with the 
determinant of a matrix 

D'^ {%-/). (A2) 

where the matrices a' and b' are the pN by pN submatrices of the payoff matrices containing the elements with 5;^ > 
and ijj > 0. We thus calculate ((|| lndet(D)||)) as a function oi Px = Py exploiting the block-structure of the matrix D 
and using results from the theory of random matrices |l^. Since we have In || det(Z?)|| — In || det(a')|| -I- ln]|dct(fe')||, 
the correlation between the elements of these matrices has no effect. We may thus use the circular theorem [Q, which 
gives the average density p{io) of eigenvalues w of a pN by pN matrix with identically and independently Gaussian 
distributed entries with zero mean and variance N~^ 

' 1 otherwise 
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giving 

((ln||dct(£i)||)) ^2Np f dujp{u;)\\uj\\ ^Np{\np-1) , (A4) 

Js 

where the integral is over the region in the complex plane with \\oj\\ < ^. After this step, the only terms in (O) where 
the disorder is present are 

2j' a a 

exp{-l/(2A^) ^(^ x1x\ Y^ y'^QirM^i.y]) 

a,b i j 

i j i 3 

where the indices a and b denote the replicas, a,b = 1 . . .n and the average has been taken over the distribution of 
payoffs (H) . In order to obtain expressions which factorize in i and j we introduce the matrices of order parameters 

i:b = ^ E i^Qiif)i^Qiit) , €b = ^ E y"©(^;)y'©(^') 






3 



using integrals over delta functions. The last pair of order paramet ers is introduced so the normalizing determinant 
may be included as a function oi p^ and p^. This procedure turns ( |A5| ) into 

n f Kb'dfJ -pr f dRi,dRi, yr f dpi^ydpi^y 

a>b a,b ' ^ ' a 

exp{-^7V J] g-^Cf ^ -^ E ^^^^'^ + *^^ E ^'^P-^ 

a>6 a,fc a 

explj] ql,~x'^Q{i'^)x''Q{x') + z«:5] i?^,*£'^i''e(x^) - ^ J] gl^,x"i 

a>b a. 6 a.fc 



?; 






explE Cfcy'^©^ )y''©(y'') + « E ^^^r ©(r )*y'' - ^ E 9^'^^"^' 

a>b a.b a,b 

E yr©(-yr)y." - »^^ E ^? - * Epa©(r )} (at) 



a, 2 a,z 



All order parameters have been introduced via conjugate variables, except i?^'j, and R^^,, which are conjugate to each 
other. Care must be taken to scale all order parameters so they are of 0(1) in the thermodynamic limit. Expression 



(A7) may now be substituted back into (Bf). The simplex-constraint is incorporated by including yet another set of 



integrals 

n / S^ ^^p{*^ E ^a'" - * E ^^^ E ■^"©(*^") ~ * E ^a E y^^iy^^y ■ (^s) 

a a a i a i 

The integrals over i^, x° now factorize and form a product of N identical terms and may thus be written as the iV-th 
power of a single such term. The same point applies to the integrals over yf-,yf. Anticipating saddle points of the 
integrals over conjugate order parameters along the imaginary axis, we also perform a change of variables ig'^^ h ~ ^ Qab 
and analogously for R^f^, E^'^ , p^'^ . Including the normalizing determinant (A4) we finally obtain ( p^ and (p^). 

13 



1. The replica symmetric ansatz 

In the thermodynamic hmit N —i- oo the integrals over order parameters are dominated by their saddle point. Yet 
in order to carry out the replica limes n — > we have to make an ansatz for the values of the order parameter matrices. 
The simplest ansatz is the replica-symmetric one given by (fiW- Since G^ and G^ are symmetric under interchange 
of the players we may drop the superscripts x and y. We obtain 

^— exp{-- Y,{q^ + q^)i'^i'^Q[i'^) + - ^ qa3:^Q{3:^)i^Q{i'^) (A9) 

(-1 a a,b 

+k{Ri - Ro) J2 i^°i"0(i°) + kRo Y^ ix'^i^eii'') - i(gi - go) Yl *"*" 

a a.b a 



2 

a,b 



A particularly efficient way to disentangle the three sums over the replica-replica couplings is to use two coupled 
Gaussian integrals over variables termed a and b echoing the original average over the payoff matrices, which yield 

//" dx°'dx°' 1 

dadbp~,{a, 6) [] / -^-— exp{-- ^(gi + 9o)£"£"e(£'') 
a a 

+k{Ri - Ro) Y Ji"i"e(.T") ~^{qi- go) Y ^°^'' + a^^^ i°e(x°) 

a a a 

a a a a a 

:=ln f dadbpi,{a,b)Y[ fv{S:'',x"-) , 

a 

where pkia,b) with k = -7== is defined by (fsl). The resulting expression factorizes giving n identical integrals over 

V9090 ' — ^ 

X and X, which may be easily performed by considering the cases x < and x > separately. The limit n — )■ of 
( |l4| ) may now be taken yielding ([l7|)-(|l9|). 

APPENDIX B: THE STABILITY OF THE REPLICA-SYMMETRIC SADDLE-POINT 

In this section we outline the calculation of the eigenvalues of the Hessian matrix of (nj) in order to check if the 
ansatz (0|) is locally stable against small fluctuations of the order parameters. We focus on the so-called replicon 
modes | |I9| ] and restrict ourselves to the case k = 0. In this case the Hessian matrix of fluctuations of (Q) around ( p^ ) 
is given by (p4|). 

The derivatives of G^ and G^ are evaluated at the RS-saddle point. Due to symmetry of G^ and G^ under an 
interchange of the players we have to find the replicon eigenvalues of three different submatrices of M , beginning with 
„- g- — : At the replica-symmetric saddle point there are three different entries in the "^"~ ' by ^ Z^ matrix of 
derivatives with respect to the off-diagonal elements of (jab with a > b. These are 

q2q { Pi for a = c.b 



where 



= \ Qi for exactly one pair of indices equal , (Bf) 

'abdqcd I ji^ ioT a^c.b^d 



P\ = {x\x^^) ~ {XaXb){XaXb) 

Ql = (xlxbXc) - {XaXb){XaXc) 

Rl = (XaXbXcXd) - {XaXb){XcXd) , (B2) 



the pointed brackets denote the normalized averages over 
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((■)> = 



na/^^^2^exp{/:-({£°,xn)}(.) 

YlJd^^exp{£-{{i-,x-})} 



(B3) 



and £^ is defined in { {Wj } and the order parameters take on their saddle point values. In the limit n ^ the replicon 
eigenvalue of this matrix equals 



Ai = Pi - 2Qi + Ri 



I 



= / dadbpf^{a, b) 



dE^ 



dadbpf^{a, b) 



lnL(a, b) 



fV{i,x)x^e{x) f fV{i,x)xe{x) 



JV{i,x) 



jT^i^ 



where I'(i, x) is defined in ( |A10|) and L{a, b) is defined in (|l5|). 



The replicon eigenvalue of 



dqabdqad 

dqabdqcd 



is evaluated in the same fashion. We obtain 

P2 for a = c,b = d 

Q2 for exactly one pair of indices equal , 

i?2 for a ^ c.b ^ d 



where 



{xlxl) - {XaXb){XaXb) 



P2 

Q2 = {XlXbXc) - {XaXb){XaXc) 
i?2 = {XaXbXcXd) ~ {XaXb){XcXd) , 

In the limit n — > the replicon eigenvalue of this matrix equals 



A2 = P2 - 2Q2 + i?2 = / dadbp~^{a, b) 

n 2 



^ V{x,x)SP' f J 'D{x,x)x 



JV(S:,x) 



JV{i,x) 



dadbpii{a,b) 



dv 



h\L{a,b) 



The matrix a ^ a- also consists of three different entries. These are 
aqaboqad 



dqabdqcd 



P3 for a = c,b = d 

Q3 for exactly one pair of indices equal , 

i?3 for a ^ c,b y^ d 



where 



P3 = -{XaXbXaXb) + {XaXb){XaXb) 
R3 = -{XaXbXcXd) + {XaXb){XcXd) ■ 

In the limit n ^ the replicon eigenvalue of this matrix equals 

/ I?(a;, x)xx<d{x) J ^{x, x)xQ{x) J 'D{x, x)x 



A3 = P3 - 2g3 + Pa = - / dadbpii{a, b) 

T 2 



-n 2 



SV{x,x) 



JV{i,x) jT>{x,x) 



dadbpfi{a, b) 



d' 



dvdE 



lnP(a, b) 



(B4) 



(B5) 



(B6) 



(B7) 



(B8) 



(B9) 



(BIO) 



Since the replicon eigenvectors of these three matrices are parallel, the eigenvalues of ( 24 ) are those of the matrix 

A2 -1 A3 

A3 -1 Ai 
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and we denote the coefficients of replicon- fluctuations as Sq'^, Sq^, Sq'^ , 5q^ . In order to determine tfie criterion for local 
stability of the RS-saddle point we first eliminate the fluctuations in the conjugate order parameters 5q^ and 5q^ near 
the saddle point. From dS/dSqV = and dS/dSq^ = one obtains (5q^ = ^{Sq"" - \3Sqy) and SqV = ^{SqV - XsSq"") 
respectively. 

This allows us to write the matrix of replicon fluctuations in terms of 6q^ and Sq^ only yielding after some algebra 

S^Srs + l{Sq^Sqy)M' f Sn + ^('5') (B12) 

with 



2 ^ "* ^ ' \Sq 



Since the integrals over the variables q^ and q^ are now over a real function, the criterion that the RS ansatz m4) is 
locally stable is that both eigenvalues of M' are negative giving 

T^(AiA2 - (A3 - 1)2) < 



2 



A 

1 ^2^ 



^(AiA2-(A3 + l)')<0. (B14) 

A2 
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